智能论文笔记

Reducing Down(stream)time: Pretraining Molecular GNNs using Heterogeneous AI Accelerators

Jenna A. Bilbrey , Kristina M. Herman , Henry Sprueill , Soritis S. Xantheas , Payel Das , Manuel Lopez Roldan , Mike Kraus , Hatem Helal , Sutanay Choudhury

分类：机器学习

2022-11-08

The demonstrated success of transfer learning has popularized approaches that involve pretraining models from massive data sources and subsequent finetuning towards a specific task. While such approaches have become the norm in fields such as natural language processing, implementation and evaluation of transfer learning approaches for chemistry are in the early stages. In this work, we demonstrate finetuning for downstream tasks on a graph neural network (GNN) trained over a molecular database containing 2.7 million water clusters. The use of Graphcore IPUs as an AI accelerator for training molecular GNNs reduces training time from a reported 2.7 days on 0.5M clusters to 1.2 hours on 2.7M clusters. Finetuning the pretrained model for downstream tasks of molecular dynamics and transfer to a different potential energy surface took only 8.3 hours and 28 minutes, respectively, on a single GPU.

translated by 谷歌翻译

Decoding the Protein-ligand Interactions Using Parallel Graph Neural Networks

Carter Knutson , Mridula Bontha , Jenna A. Bilbrey , Neeraj Kumar

分类： (统计)机器学习 | 机器学习

2021-11-30

蛋白质 - 配体相互作用（PLIS）是生化研究的基础，其鉴定对于估计合理治疗设计的生物物理和生化特性至关重要。目前，这些特性的实验表征是最准确的方法，然而，这是非常耗时和劳动密集型的。在这种情况下已经开发了许多计算方法，但大多数现有PLI预测大量取决于2D蛋白质序列数据。在这里，我们提出了一种新颖的并行图形神经网络（GNN），以集成PLI预测的知识表示和推理，以便通过专家知识引导的深度学习，并通过3D结构数据通知。我们开发了两个不同的GNN架构，GNNF是采用不同特种的基础实现，以增强域名认识，而GNNP是一种新颖的实现，可以预测未经分子间相互作用的先验知识。综合评价证明，GNN可以成功地捕获配体和蛋白质3D结构之间的二元相互作用，对于GNNF的测试精度和0.958，用于预测蛋白质 - 配体络合物的活性。这些模型进一步适用于回归任务以预测实验结合亲和力，PIC50对于药物效力和功效至关重要。我们在实验亲和力上达到0.66和0.65的Pearson相关系数，分别在PIC50和GNNP上进行0.50和0.51，优于基于2D序列的模型。我们的方法可以作为可解释和解释的人工智能（AI）工具，用于预测活动，效力和铅候选的生物物理性质。为此，我们通过筛选大型复合库并将我们的预测与实验测量数据进行比较来展示GNNP对SARS-COV-2蛋白靶标的实用性。

translated by 谷歌翻译

SLATE: A Sequence Labeling Approach for Task Extraction from Free-form Inked Content

Apurva Gandhi , Ryan Serrao , Biyi Fang , Gilbert Antonius , Jenna Hong , Tra My Nguyen , Sheng Yi , Ehi Nosakhare , Irene Shaffer , Soundararajan Srinivasan

分类：自然语言处理 | 机器学习

2022-11-08

We present SLATE, a sequence labeling approach for extracting tasks from free-form content such as digitally handwritten (or "inked") notes on a virtual whiteboard. Our approach allows us to create a single, low-latency model to simultaneously perform sentence segmentation and classification of these sentences into task/non-task sentences. SLATE greatly outperforms a baseline two-model (sentence segmentation followed by classification model) approach, achieving a task F1 score of 84.4\%, a sentence segmentation (boundary similarity) score of 88.4% and three times lower latency compared to the baseline. Furthermore, we provide insights into tackling challenges of performing NLP on the inking domain. We release both our code and dataset for this novel task.

translated by 谷歌翻译

Disparate Censorship & Undertesting: A Source of Label Bias in Clinical Machine Learning

Trenton Chang , Michael W. Sjoding , Jenna Wiens

分类：机器学习

2022-08-01

随着机器学习（ML）模型在临床应用中获得吸引力，了解临床医生和社会偏见对ML模型的影响越来越重要。尽管用于模型训练的标签可能会出现偏见，但这些偏见的许多来源尚未得到充分研究。在本文中，我们重点介绍了不同的审查制度（即，患者组的测试率差异）是临床ML模型可能会放大的标签偏差来源，可能造成损害。许多患者风险分层模型都使用标签的临床医生诊断和实验室测试的结果进行培训。没有测试结果的患者通常会分配负标签，该标签假设未经测试的患者没有经历结果。由于订单受到临床和资源考虑因素的影响，因此在患者人群中进行测试可能不统一，从而导致不同的审查制度。同等风险患者的不同审查制度会导致某些组的承诺，进而对此类组的有偏见的标签进行审查。在标准ML管道中使用此类偏见的标签可能会导致患者组的模型性能差距。在这里，我们从理论和经验上表征了不同的条件，在这些条件下，不同的审查制度或承诺会影响跨亚组的模型绩效。我们的发现呼吁人们注意不同的审查制度，作为临床ML模型中标签偏差的来源。

translated by 谷歌翻译

GEMv2: Multilingual NLG Benchmarking in a Single Line of Code

Sebastian Gehrmann , Abhik Bhattacharjee , Abinaya Mahendiran , Alex Wang , Alexandros Papangelis , Aman Madaan , Angelina McMillan-Major , Anna Shvets , Ashish Upadhyay , Bingsheng Yao

分类：自然语言处理 | 人工智能 | 机器学习

2022-06-22

通常通过过去的选择来告知机器学习中的评估，例如要使用哪些数据集或指标。该标准化可以使用排行榜对平等基础进行比较，但是随着出现更好的替代方案，评估选择变得不佳。这个问题在自然语言生成中尤其相关，该语言需要不断改善的数据集，指标和人类评估以提出确定性的主张。为了使遵循最佳模型评估实践更加容易，我们介绍了GEMV2。新版本的一代，评估和指标基准为数据集，模型和指标开发人员提供了模块化基础架构，以使彼此受益。GEMV2支持40种记录的数据集中51种语言。所有数据集的模型都可以在线评估，我们的交互式数据卡创建和渲染工具使得在Living Benchmark中添加新数据集变得更加容易。

translated by 谷歌翻译

Projecting Robot Navigation Paths: Hardware and Software for Projected AR

Zhao Han , Jenna Parrillo , Alexander Wilkinson , Holly A. Yanco , Tom Williams

分类：机器人

2021-12-09

对于移动机器人，移动机械手和自治车辆，以安全地在街道和仓库等人口众多的地方驾驶，人类观察者必须能够理解他们的导航意图。启用这种理解的一种方法是通过在周围环境上的投影来可视化这一意图。但尽管存在此类预测的有效性，但不存在具有集成硬件设置的开放式代码库。在这项工作中，我们详细介绍了这种定向预测的有效性的经验证据，并使用广泛使用的机器人操作系统（ROS）和RVIZ在C ++中分享了这种预测的机器人无关的实施。此外，我们使用获取机器人演示用于部署此软件的硬件配置，并简要概括激励此配置的全尺寸用户学习。代码，配置文件（Roslaunch和RVIZ文件）以及文档在Github上自由地提供HTTPS://github.com/umhan35/Arrow_Projection。

translated by 谷歌翻译

A Dataset of Stationary, Fixed-wing Aircraft on a Collision Course for Vision-Based Sense and Avoid

Jasmin Martin , Jenna Riseley , Jason J. Ford

分类：机器人 | 计算机视觉

2021-12-06

预计将在2026年促使新兴的无人机航空公司（UAV）服务市场达到584亿美元，促使常规将常规无人机运营促进到国家空域中的重大努力，以至于它们不会损害现有的安全水平。通过感觉和避免潜在的中空碰撞威胁，将提高无人机的商业用途，但是在缺乏可用的数据集时，该领域的研究是缺乏可用的数据集，因为它们昂贵且技术上是为了捕获。在本文中，我们为基于视觉的飞机检测提供了一个数据集。 DataSet由15个图像序列组成，其中包含55,521张固定翼飞机的图像，接近固定式接地的摄像头。还提供了地面真理标签和绩效基准。为了我们的知识，这是第一个在碰撞课程上学习中型固定翼飞机的第一个公共数据集。完整的数据集和地面真理标签在https://qcr.github.io/dataset/aircraft -collision-.c资料/航空公司

translated by 谷歌翻译

A Tutorial on Parametric Variational Inference

Jens Sjölund

分类： (统计)机器学习 | 机器学习

2023-01-03

Variational inference uses optimization, rather than integration, to approximate the marginal likelihood, and thereby the posterior, in a Bayesian model. Thanks to advances in computational scalability made in the last decade, variational inference is now the preferred choice for many high-dimensional models and large datasets. This tutorial introduces variational inference from the parametric perspective that dominates these recent developments, in contrast to the mean-field perspective commonly found in other introductory texts.

translated by 谷歌翻译

A Survey On Few-shot Knowledge Graph Completion with Structural and Commonsense Knowledge

Haodi Ma , Daisy Zhe Wang

分类：自然语言处理 | 人工智能 | 机器学习

2023-01-03

Knowledge graphs (KG) have served as the key component of various natural language processing applications. Commonsense knowledge graphs (CKG) are a special type of KG, where entities and relations are composed of free-form text. However, previous works in KG completion and CKG completion suffer from long-tail relations and newly-added relations which do not have many know triples for training. In light of this, few-shot KG completion (FKGC), which requires the strengths of graph representation learning and few-shot learning, has been proposed to challenge the problem of limited annotated data. In this paper, we comprehensively survey previous attempts on such tasks in the form of a series of methods and applications. Specifically, we first introduce FKGC challenges, commonly used KGs, and CKGs. Then we systematically categorize and summarize existing works in terms of the type of KGs and the methods. Finally, we present applications of FKGC models on prediction tasks in different areas and share our thoughts on future research directions of FKGC.

translated by 谷歌翻译

Reference Twice: A Simple and Unified Baseline for Few-Shot Instance Segmentation

Yue Han , Jiangning Zhang , Zhucun Xue , Chao Xu , Xintian Shen , Yabiao Wang , Chengjie Wang , Yong Liu , Xiangtai Li

分类：计算机视觉

2023-01-03

Few Shot Instance Segmentation (FSIS) requires models to detect and segment novel classes with limited several support examples. In this work, we explore a simple yet unified solution for FSIS as well as its incremental variants, and introduce a new framework named Reference Twice (RefT) to fully explore the relationship between support/query features based on a Transformer-like framework. Our key insights are two folds: Firstly, with the aid of support masks, we can generate dynamic class centers more appropriately to re-weight query features. Secondly, we find that support object queries have already encoded key factors after base training. In this way, the query features can be enhanced twice from two aspects, i.e., feature-level and instance-level. In particular, we firstly design a mask-based dynamic weighting module to enhance support features and then propose to link object queries for better calibration via cross-attention. After the above steps, the novel classes can be improved significantly over our strong baseline. Additionally, our new framework can be easily extended to incremental FSIS with minor modification. When benchmarking results on the COCO dataset for FSIS, gFSIS, and iFSIS settings, our method achieves a competitive performance compared to existing approaches across different shots, e.g., we boost nAP by noticeable +8.2/+9.4 over the current state-of-the-art FSIS method for 10/30-shot. We further demonstrate the superiority of our approach on Few Shot Object Detection. Code and model will be available.

translated by 谷歌翻译